NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Optimization using pathwise algorithmic derivatives of electromagnetic shower simulations

Aehle, Max; Novák, Mihály; Vassilev, Vassil; Gauger, Nicolas R; Heinrich, Lucas; Kagan, Michael; Lange, David (April 2025, Computer physics communications)

Among the well-known methods to approximate derivatives of expectancies computed by Monte-Carlo simulations, averages of pathwise derivatives are often the easiest one to apply. Computing them via algorithmic differentiation typically does not require major manual analysis and rewriting of the code, even for very complex programs like simulations of particle-detector interactions in high-energy physics. However, the pathwise derivative estimator can be biased if there are discontinuities in the program, which may diminish its value for applications. This work integrates algorithmic differentiation into the electromagnetic shower simulation code HepEmShow based on G4HepEm, allowing us to study how well pathwise derivatives approximate derivatives of energy depositions in a sampling calorimeter with respect to parameters of the beam and geometry. We found that when multiple scattering is disabled in the simulation, means of pathwise derivatives converge quickly to their expected values, and these are close to the actual derivatives of the energy deposition. Additionally, we demonstrate the applicability of this novel gradient estimator for stochastic gradient-based optimization in a model example.
more » « less
Free, publicly-accessible full text available April 1, 2026
The 200 Gbps Challenge: Imagining HL-LHC analysis facilities

https://doi.org/10.1051/epjconf/202533701217

Held, Alexander; Albin, Sam; Attebury, Garhan; Bloom, Kenneth; Bockelman, Brian; Bryant, Lincoln; Choi, Kyungeon; Cranmer, Kyle; Elmer, Peter; Feickert, Matthew; et al (October 2025, EPJ Web of Conferences)
Szumlak, T; Rachwał, B; Dziurda, A; Schulz, M; vom_Bruch, D; Ellis, K; Hageboeck, S (Ed.)
The IRIS-HEP software institute, as a contributor to the broader HEP Python ecosystem, is developing scalable analysis infrastructure and software tools to address the upcoming HL-LHC computing challenges with new approaches and paradigms, driven by our vision of what HL-LHC analysis will require. The institute uses a “Grand Challenge” format, constructing a series of increasingly large, complex, and realistic exercises to show the vision of HL-LHC analysis. Recently, the focus has been demonstrating the IRIS-HEP analysis infrastructure at scale and evaluating technology readiness for production. As a part of the Analysis Grand Challenge activities, the institute executed a “200 Gbps Challenge”, aiming to show sustained data rates into the event processing of multiple analysis pipelines. The challenge integrated teams internal and external to the institute, including operations and facilities, analysis software tools, innovative data delivery and management services, and scalable analysis infrastructure. The challenge showcases the prototypes — including software, services, and facilities — built to process around 200 TB of data in both the CMS NanoAOD and ATLAS PHYSLITE data formats with test pipelines. The teams were able to sustain the 200 Gbps target across multiple pipelines. The pipelines focusing on event rate were able to process at over 30 MHz. These target rates are demanding; the activity revealed considerations for future testing at this scale and changes necessary for physicists to work at this scale in the future. The 200 Gbps Challenge has established a baseline on today’s facilities, setting the stage for the next exercise at twice the scale.
more » « less
Free, publicly-accessible full text available October 7, 2026
Fast And Automatic Floating Point Error Analysis With CHEF-FP

https://doi.org/10.1109/IPDPS54959.2023.00105

Singh, Garima; Kundu, Baidyanath; Menon, Harshitha; Penev, Alexander; Lange, David J.; Vassilev, Vassil (May 2023, 2023 IEEE International Parallel and Distributed Processing Symposium)

As we reach the limit of Moore’s Law, researchers are exploring different paradigms to achieve unprecedented performance. Approximate Computing (AC), which relies on the ability of applications to tolerate some error in the results to trade-off accuracy for performance, has shown significant promise. Despite the success of AC in domains such as Machine Learning, its acceptance in High-Performance Computing (HPC) is limited due to its stringent requirement of accuracy. We need tools and techniques to identify regions of the code that are amenable to approximations and their impact on the application output quality so as to guide developers to employ selective approximation. To this end, we propose CHEF-FP, a flexible, scalable, and easy-to-use source-code transformation tool based on Automatic Differentiation (AD) for analysing approximation errors in HPC applications. CHEF-FP uses Clad, an efficient AD tool built as a plugin to the Clang compiler and based on the LLVM compiler infrastructure, as a backend and utilizes its AD abilities to evaluate approximation errors in C++ code. CHEF-FP works at the source level by injecting error estimation code into the generated adjoints. This enables the error-estimation code to undergo compiler optimizations resulting in improved analysis time and reduced memory usage. We also provide theoretical and architectural augmentations to source code transformation-based AD tools to perform FP error analysis. In this paper, we primarily focus on analyzing errors introduced by mixed-precision AC techniques, the most popular approximate technique in HPC. We also show the applicability of our tool in estimating other kinds of errors by evaluating our tool on codes that use approximate functions. Moreover, we demonstrate the speedups achieved by CHEF-FP during analysis time as compared to the existing state-of-the-art tool as a result of its ability to generate and insert approximation error estimate code directly into the derivative source. The generated code also becomes a candidate for better compiler optimizations contributing to lesser runtime performance overhead.
more » « less
Full Text Available
GPU Accelerated Automatic Differentiation With Clad

Ifrim, Ioana; Vassilev, Vassil; Lange, David (October 2022, 20th International Workshop on Advanced Computing and Analysis Techniques in Physics Research)

Automatic Differentiation (AD) is instrumental for science and industry. It is a tool to evaluate the derivative of a function specified through a computer program. The range of AD application domain spans from Machine Learning to Robotics to High Energy Physics. Computing gradients with the help of AD is guaranteed to be more precise than the numerical alternative and have a low, constant factor more arithmetical operations compared to the original function. Moreover, AD applications to domain problems typically are computationally bound. They are often limited by the computational requirements of high-dimensional parameters and thus can benefit from parallel implementations on graphics processing units (GPUs). Clad aims to enable differential analysis for C/C++ and CUDA and is a compiler-assisted AD tool available both as a compiler extension and in ROOT. Moreover, Clad works as a plugin extending the Clang compiler; as a plugin extending the interactive interpreter Cling; and as a Jupyter kernel extension based on xeus-cling. We demonstrate the advantages of parallel gradient computations on GPUs with Clad. We explain how to bring forth a new layer of optimization and a proportional speed up by extending Clad to support CUDA. The gradients of well-behaved C++ functions can be automatically executed on a GPU. The library can be easily integrated into existing frameworks or used interactively. Furthermore, we demonstrate the achieved application performance improvements, including (~10x) in ROOT histogram fitting and corresponding performance gains from offloading to GPUs.
more » « less
Full Text Available
AwkwardForth: accelerating Uproot with an internal DSL

https://doi.org/10.1051/epjconf/202125103002

Pivarski, Jim; Osborne, Ianna; Das, Pratyush; Lange, David; Elmer, Peter (January 2021, EPJ Web of Conferences)
Biscarat, C.; Campana, S.; Hegner, B.; Roiser, S.; Rovelli, C.I.; Stewart, G.A. (Ed.)
File formats for generic data structures, such as ROOT, Avro, and Parquet, pose a problem for deserialization: it must be fast, but its code depends on the type of the data structure, not known at compile-time. Just-in-time compilation can satisfy both constraints, but we propose a more portable solution: specialized virtual machines. AwkwardForth is a Forth-driven virtual machine for deserializing data into Awkward Arrays. As a language, it is not intended for humans to write, but it loosens the coupling between Uproot and Awkward Array. AwkwardForth programs for deserializing record-oriented formats (ROOT and Avro) are about as fast as C++ ROOT and 10–80× faster than fastavro. Columnar formats (simple TTrees, RNTuple, and Parquet) only require specialization to interpret metadata and are therefore faster with precompiled code.
more » « less
Full Text Available
Nested data structures in array frameworks

https://doi.org/10.1088/1742-6596/1525/1/012053

Pivarski, Jim; Lange, David; Elmer, Peter (April 2020, Journal of Physics: Conference Series)

Full Text Available
Deep learning-based automated image segmentation for concrete petrographic analysis

https://doi.org/10.1016/j.cemconres.2020.106118

Song, Yu; Huang, Zilong; Shen, Chuanyue; Shi, Humphrey; Lange, David A. (September 2020, Cement and Concrete Research)

Full Text Available
Inexpensive Multipatient Respiratory Monitoring System for Helmet Ventilation During COVID-19 Pandemic

https://doi.org/10.1115/1.4053386

Bourrianne, Philippe; Chidzik, Stanley; J. Cohen, Daniel; Elmer, Peter; Hallowell, Thomas; Kilbaugh, Todd J.; Lange, David; Leifer, Andrew M.; Marlow, Daniel R.; Meyers, Peter D.; et al (March 2022, Journal of Medical Devices)

Abstract Helmet continuous positive applied pressure is a form of noninvasive ventilation (NIV) that has been used to provide respiratory support to COVID-19 patients. Helmet NIV is low-cost, readily available, provides viral filters between the patient and clinician, and may reduce the need for invasive ventilation. Its widespread adoption has been limited, however, by the lack of a respiratory monitoring system needed to address known safety vulnerabilities and to monitor patients. To address these safety and clinical needs, we developed an inexpensive respiratory monitoring system based on readily available components suitable for local manufacture. Open-source design and manufacturing documents are provided. The monitoring system comprises flow, pressure, and CO2 sensors on the expiratory path of the helmet circuit and a central remote station to monitor up to 20 patients. The system is validated in bench tests, in human-subject tests on healthy volunteers, and in experiments that compare respiratory features obtained at the expiratory path to simultaneous ground-truth measurements from proximal sensors. Measurements of flow and pressure at the expiratory path are shown to deviate at high flow rates, and the tidal volumes reported via the expiratory path are systematically underestimated. Helmet monitoring systems exhibit high-flow rate, nonlinear effects from flow and helmet dynamics. These deviations are found to be within a reasonable margin and should, in principle, allow for calibration, correction, and deployment of clinically accurate derived quantities.
more » « less
Full Text Available
A Roadmap for HEP Software and Computing R&D for the 2020s

https://doi.org/10.1007/s41781-018-0018-8

Albrecht, Johannes; Alves, Antonio Augusto; Amadio, Guilherme; Andronico, Giuseppe; Anh-Ky, Nguyen; Aphecetche, Laurent; Apostolakis, John; Asai, Makoto; Atzori, Luca; Babik, Marian; et al (December 2019, Computing and Software for Big Science)

Full Text Available
A new calibration method for charm jet identification validated with proton-proton collision events at √s = 13 TeV

https://doi.org/10.1088/1748-0221/17/03/P03014

Tumasyan, Armen; Adam, Wolfgang; Andrejkovic, Janik Walter; Bergauer, Thomas; Chatterjee, Suman; Dragicevic, Marko; Escalante Del Valle, Alberto; Fruehwirth, Rudolf; Jeitler, Manfred; Krammer, Natascha; et al (March 2022, Journal of Instrumentation)

Abstract Many measurements at the LHC require efficient identification of heavy-flavour jets, i.e. jets originating from bottom (b) or charm (c) quarks. An overview of the algorithms used to identify c jets is described and a novel method to calibrate them is presented. This new method adjusts the entire distributions of the outputs obtained when the algorithms are applied to jets of different flavours. It is based on an iterative approach exploiting three distinct control regions that are enriched with either b jets, c jets, or light-flavour and gluon jets. Results are presented in the form of correction factors evaluated using proton-proton collision data with an integrated luminosity of 41.5 fb -1 at √s = 13 TeV, collected by the CMS experiment in 2017. The closure of the method is tested by applying the measured correction factors on simulated data sets and checking the agreement between the adjusted simulation and collision data. Furthermore, a validation is performed by testing the method on pseudodata, which emulate various mismodelling conditions. The calibrated results enable the use of the full distributions of heavy-flavour identification algorithm outputs, e.g. as inputs to machine-learning models. Thus, they are expected to increase the sensitivity of future physics analyses.
more » « less
Full Text Available

Search for: All records